Search CORE

21 research outputs found

Software defect prediction based on association rule classification.

Author: Baesens Bart
Baojun Ma
Dejaeger Karel
Vanthienen Jan
Publication venue
Publication date
Field of study

In software defect prediction, predictive models are estimated based on various code attributes to assess the likelihood of software modules containing errors. Many classification methods have been suggested to accomplish this task. However, association based classification methods have not been investigated so far in this context. This paper assesses the use of such a classification method, CBA2, and compares it to other rule based classification methods. Furthermore, we investigate whether rule sets generated on data from one software project can be used to predict defective software modules in other, similar software projects. It is found that applying the CBA2 algorithm results in both accurate and comprehensible rule sets.Software defect prediction; Association rule classification; CBA2; AUC;

Research Papers in Economics

Software effort prediction using regression rule extraction from neural networks.

Author: Baesens Bart
Dejaeger Karel
Martens David
Setiono Rudy
Verbeke Wouter
Publication venue
Publication date
Field of study

Research Papers in Economics

Essays on empirical software engineering.

Author: Dejaeger Karel
Publication venue
Publication date: 05/12/2012
Field of study

status: publishe

Lirias

Software requirements engineering: een iteratieve aanpak

Author: Dedene Guido
Dejaeger Karel
Poelmans Jonas
Publication venue: Sdu Uitgevers bv
Publication date: 01/01/2011
Field of study

status: publishe

Lirias

Towards comprehensible software fault prediction models using Bayesian network classifiers

Author: Baesens Bart
Dejaeger Karel
Verbraken Thomas
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Software testing is a crucial activity during software development and fault prediction models assist practitioners herein by providing an upfront identification of faulty software code by drawing upon the machine learning literature. While especially the Naive Bayes classifier is often applied in this regard, citing predictive performance and comprehensibility as its major strengths, a number of alternative Bayesian algorithms that boost the possibility to construct simpler networks with less nodes and arcs remain unexplored. This study contributes to the literature by considering 15 different Bayesian Network (BN) classifiers and comparing them to other popular machine learning techniques. Furthermore, the applicability of the Markov blanket principle for feature selection, which is a natural extension to BN theory, is investigated. The results, both in terms of the AUC and the recently introduced H-measure, are rigorously tested using the statistical framework of Demsar. It is concluded that simple and comprehensible networks with less nodes can be constructed using BN classifiers other than the Naive Bayes classifier. Furthermore, it is found that the aspects of comprehensibility and predictive performance need to be balanced out, and also the development context is an item which should be taken into account during model selection

Southampton (e-Prints Soton)

Software requirements engineering: een iteratieve aanpak.

Author: Dedene Guido
Dejaeger Karel
Poelmans Jonas
Publication venue
Publication date
Field of study

Research Papers in Economics

A multidimensional analysis of data quality for credit risk management: new insights and challenges

Author: Baesens Bart
Dejaeger Karel
Lemahieu Wilfried
Moges H.
Publication venue: 'Elsevier BV'
Publication date: 01/01/2013
Field of study

Interest in group moods as an emergent phenomenon of group members’ interactions has significantly increased over the past two decades (Barsade & Gibson, 2007). Most studies focused particularly on understanding the effects of group moods on group processes (Barsade, 2001, Baartel & Saavedra, 2000, Barsade, Ward, Turner & Sonnenfled, 2000, Chiayu Tu, 2009) and group performance (Seung -Yoon Ree, 2006, Jordan, Lawrence & Troth, 2006). However, research investigating the antecedents of group moods is still scant. The current study fills this gap by focusing on the affective potential of group conflict. In this sense, group conflict focuses on how differences of opinion (task conflict) and person-related disagreements (relationship conflict) trigger group moods that differ in their valence (positive and negative) and level of activation (activated and unactivated) (Baartel & Saavedra, 2000). In this context, the group’s ability to define and understand its moods, their cause, evolution and relations between them - ability known as group emotional intelligence (Salovey & Mayer, 1990) - is expected to buffer the relation between conflict and group moods. By studying group moods in relation to group conflict, the current study extends previous research by considering group moods’ antecedents and not only their consequences. This contributes to a better understanding of group affect dynamics. In addition, the current study investigates different nuances of group moods given by different types of conflict. Whether an affect has a positive or negative valence, or whether it is activated or inactivated, has implications upon the further group dynamics

Vlerick Repository

Data mining techniques for software effort estimation: a comparative study

Author: Baesens Bart
Dejaeger Karel
Martens David
Verbeke Wouter
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2012
Field of study

A predictive model is required to be accurate and comprehensible in order to inspire confidence in a business setting. Both aspects have been assessed in a software effort estimation setting by previous studies. However, no univocal conclusion as to which technique is the most suited has been reached. This study addresses this issue by reporting on the results of a large scale benchmarking study. Different types of techniques are under consideration, including techniques inducing tree/rule based models like M5 and CART, linear models such as various types of linear regression, nonlinear models (MARS, multilayered perceptron neural networks, radial basis function networks, and least squares support vector machines), and estimation techniques that do not explicitly induce a model (e.g., a case-based reasoning approach). Furthermore, the aspect of feature subset selection by using a generic backward input selection wrapper is investigated. The results are subjected to rigorous statistical testing and indicate that ordinary least squares regression in combination with a logarithmic transformation performs best. Another key finding is that by selecting a subset of highly predictive attributes such as project size, development, and environment related attributes, typically a significant increase in estimation accuracy can be obtaine

Lirias

Southampton (e-Prints Soton)

Institutional Repository Universiteit Antwerpen

Software defect prediction based on association rule classification

Author: Baesens Bart
Baojun Ma
Dejaeger Karel
Vanthienen Jan
Publication venue: K.U.Leuven - Faculty of Business and Economics
Publication date: 01/02/2011
Field of study

Lirias

De kosten van software-ontwikkeling voorspellen.

Author: Baesens Bart
Dejaeger Karel
Martens David
Verbeke Wouter
Publication venue
Publication date
Field of study

Om de kosten van softwareontwikkeling in te schatten kunnen dataminingtechnieken worden gebruikt. Recentelijk zijn de formele modellen op de achtergrond geraakt ten gunste van de - objectievere - dataminingtechnieken. Uit onderzoek naar het gebruik van technieken als regressie, neurale netwerken en beslissingsbomen op vier softwareontwikkeldatasets blijkt dat de eenvoudige technieken het beste scoren.

Research Papers in Economics